Improving deep speech denoising by Noisy2Noisy signal mapping
نویسندگان
چکیده
Existing deep learning-based speech denoising approaches require clean signals to be available for training. This paper presents a approach improve in real-world audio environments by not requiring the availability of as reference training mode. A fully convolutional neural network is trained using two noisy realizations same signal, one used input and other target network. Two signal are generated mid-side stereo microphone. Extensive experimentations conducted show superiority developed over conventional supervised based on four commonly performance metrics well subjective testing.
منابع مشابه
Speech enhancement based on deep denoising autoencoder
We previously have applied deep autoencoder (DAE) for noise reduction and speech enhancement. However, the DAE was trained using only clean speech. In this study, by using noisyclean training pairs, we further introduce a denoising process in learning the DAE. In training the DAE, we still adopt greedy layer-wised pretraining plus fine tuning strategy. In pretraining, each layer is trained as a...
متن کاملExperiments on deep learning for speech denoising
In this paper we present some experiments using a deep learning model for speech denoising. We propose a very lightweight procedure that can predict clean speech spectra when presented with noisy speech inputs, and we show how various parameter choices impact the quality of the denoised signal. Through our experiments we conclude that such a structure can perform better than some comparable sin...
متن کاملLearning a speech manifold for signal subspace speech denoising
We present a method for learning a low-dimensional manifold for speech from clean speech samples in high-dimensional space. Using this manifold, we perform speech denoising by projecting noisy speech onto the manifold to remove nonspeech components. This method of denoising classifies our algorithm as a signal subspace denoising method, where highdimensional noisy data is projected onto the sig...
متن کاملDeep Factorization for Speech Signal
Speech signals are complex intermingling of various informative factors, and this information blending makes decoding any of the individual factors extremely difficult. A natural idea is to factorize each speech frame into independent factors, though it turns out to be even more difficult than decoding each individual factor. A major encumbrance is that the speaker trait, a major factor in spee...
متن کاملDeep Denoising Auto-encoder for Statistical Speech Synthesis
This paper proposes a deep denoising auto-encoder technique to extract better acoustic features for speech synthesis. The technique allows us to automatically extract low-dimensional features from high dimensional spectral features in a non-linear, data-driven, unsupervised way. We compared the new stochastic feature extractor with conventional mel-cepstral analysis in analysis-by-synthesis and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied Acoustics
سال: 2021
ISSN: ['0003-682X', '1872-910X']
DOI: https://doi.org/10.1016/j.apacoust.2020.107631